fJOVEL MUTATED MAMMALIAN CELLS 

AND ANIMALS 



The present application claims priority to U.S. Application 
Number 09/728,445 which was filed November 30, 2000 which claimed 
priority to U.S. Provisional Application Number 60/168,358 which 
was filed December 1, 1999. The present application incorporates 
U.S. Patent No. 6,080,576 and U.S. Applications Ser. Nos. 
08/726, 367 , 0 : /728 , 963, 08/ 907 , 5 98, Or / 942, 806, 60/ 109, 302, and 
09/2^6,533 an i their respective disclosures heroin by reference 
in their entirety. 

1.0. FIELD OF THE INVENTION 

The present invention is in the field of molecular genetics. 
The appl ic:a t ion disc hoses novel, mutated cells that are generated 
by process involving the insertion of at least a porti on of a 
qenet i ca 1 1 y engineered viral vector into the chromosome. The 
specifically disclosed recombinant vest or allows for the rapid 
i . ;1 e n t i. f i a t i o > n o :. t n e gene t h at m s b e e n mutated b y a s i n g 
nucl.eot.ide or am: no acid sequence information to identify the 
gene that has been mutated by tne vector. Wnen mutated embryonic 
stem cell clones are produced, such cells can be used to produce 
mutant animals capable -of germ line transmission of the described 
m,i t a t ed ^enes . 

2.0. BACKGROUND OF THE INVENTION 

Most mammalian genes are divided into- exons and introns. 
hxcns are the portions of the gene that are spliced into mFXA and 
encode the protein product of a gene. In genomic DNA, these 
coding excns are often oivideo by nencc ding intron sequences. 
A 1th: ugh PNA polymerase transcribes both intron and exon 
sequences, the introri sequences must be removed from the 
transcript so that the resulting mRNA can bo- translated into 
p. rot e in. Accordingly, all mammalian, and most eukaryotic, cells 
have the machinery to splice exons to produce mF'.HA. Gene trap 
vectors have been designed to insert into the introns of genes in 



5 



10 



15 



0 



2;0 



acni 



a manner that aTTows the cellular splicing machinery to splice 
vector encoded exons to cellular mRMs . Commonly, gene crap 
vectors contain select aole marker sequences that are preceded by 
strong splice acceptor sequences and are not preceded by a 



promote r . Thus , when 



tors inteur 



into a gene, the 



on: :■ tn 



ellulur splicing machinery splices exor.s from the trapped gene 
t ' end o f ::he select aole marker se -que rice . Typically, 

selectable marker genes can -only be expressed if the vector 



encodin:; the cene has mtecrareo i nt. o an int. ron. The result in a 



Cf h 1 



rap eve n t s a r e s ub s e q uent 1 y identified by s e 1 e c L i n g f o r 



cells that can survive selective culture. 

Gene trapping has generally proven to be an efficient method 
of mutating large numbers -of genes. The insertion of the gene 
t rap vector creates a mutation in the trapped gene, and also 



r : v i de s a mo 1 e cular t a q ft r 



:i s e 



■: f i do- n t. i f. y i n g t n e g e n e t h a t 



n a 



;.: . i 



t rape e a 



When P.O. 



demo ns t rated tn 



1 t* a s t 



u s e cl 



t rap 



q e n e s it 



resulted in a phenotyr 



> w h e n 



■of tne resulting mutations 
xamined in mice. This- indicate 



t h a t t 'n e g e n e t r a p i n s ert i o n 



rs are useful mutagens. 



Although a powerful tood f:r mutatinc genes, the potential 
method has historically been limited by the difficulty in 



or the 



identifyinq the t raooed qenes. Methods that have been used to 



identify trap events rely on the fusion transcripts result! 



f rem 



t n e- s 



1 i cina 



x r 



seouences from. 



rat'pe:; ore: 



sequences encoded by the gene- trap vector. Common gene 
identification p rotocols usee to obtain sequences from these 
fusion transcripts include c 1 0ACF1, cDNA cboning, and cloning of 



genomic: ChA surrounoOing one site of vector integration:. nowever, 
tnese methods have proven laoe r intensive, not readily amenable 
to' automation, and generally impractlca 1 for h igh-th roughput . 



More recently, vectors have been developed that rely on a 



new s t r a t eqv < > 



r gene trapping that 
selectable marker gene preceded by a 



uses a vector that con:ains a 
promoter and foil owed by a 



splice donor sequence instead of a pc lyadenylat ion sequence. 





These vectors 3^^not provide selection unless thoy integrate into 
a gene and subsequently trap downstream exons which pirovide a 
polyadenylation sequence. Integration of such vectors into the 
chromosome results in the splicing of the selectable marker gene 
5 to :' exons of the trap>pe:l gene. These vectors provide a number 
of a civ an c ages . Tney car. be used to trap genes regardless of 
whether the genes are normally expressed in the cell type in 
which tne vector has integrated. In addition, cells harboring 
such vectors can be screened us i rig automated (e.c., 9 6 -we 11 plate 
10 forinat-- gene identification assays such as :'■ ' RACE (see 

generally, Erchman, 199-4, PTR Methods and Applications, 4:S40- 
S53) . Using these vectors it is possible to produce large 
numbers of mutations and rapidly identify tne mutated, or 
trapped, gene by DNA sequence analysis. 



The subject invention provioes numerous isolated mammalian 
mu t a n t cell clones that are e a c hi c Kara c :er i zed b y tne insert! c n 
of a mutagenic genetically engineered polynucleotide sequence 
20 into a gene identifiable as corresponding to- cone or more of the 
OMNIEAUK gene trapped sequences (GTSs) disclosed in Sequence 
Listing. 

Tne subject invent oon further contemplates a mutated cell, 
and particularly a mutated ES cell, and the animals derived from 
2 5 such ES cell that stably maintain a genetically engineered 

mutation in a gene identifiable as corresponding to one of the 
disclosed GTSs . 
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3.0. 



SUMMARY OF THE INVENTION 



4.0. 



DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 



30 



The Sequence 



Listing is a compilation of nucleotide 



sequences < 



obtained by sequencing clonal lines of gene trapped 



murine ES 



ceils . 
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Figures 1h p i Z present a diagrammatic representation of 
representative gene trap vectors used to generate the described 
sequences . 

Figure 2 p r o v i ■; ie s an index to the Sequence Listing a n d the 
5 co> r r es pon :.lin g database accessor! numbers for the genes that have 
been mutated according to the present invention. 



5.0. DETAILED DESCRIPTION OF THE INVENTION 

The current invention relates to novel mutated mammalian 

1 0 c -1- lis that are each characterized by the insert! :> n o f a 
r e c omb i n a n t 'i.e., g e n e t i c a 1 1. y e n g i neered] mu t a g e n 1 c 
polynucleotide sequence into a gene identifiable as corresponding 
to cne of the GTSs of SE^ ID WCS : 1 - 8 9 1 . 
For the purposes of the present invention, the term 

15 "identifiable" is to be construed as indicating that a mammalian 
cell, and pre f e raol y , a murine- F5 cell, nas been mutated by the 
insertion of a p>: iynucleoti.de sequence of re ccmbinant ly 
ma n 1 p u 1 a t e d o r i g i n a t a g e n e t i c 1 o c u s t h a t no r ma 1 1 y c :>mp rises 
poo 1 ynuo 1 eo t ide sequence, ana/or pos t . -spl iced exonic sequence, 

20 that is at least partially described in one -of the GTSs of 

Sequence Listing. One method of determining whether one of the 
described mutated mammalian cells has a mutation in a gene of 
interest is by comparing the polynucleotide sequence (or a 
C'jrresp ending ammo acid sequence) of tne GTS identifying the 

2 5 mutated Locus to the full Length sequence of the gene. 

Alternatively, such searches can be conduct eel by comparing the 
described GTS sequence t: a well known database (such as, but not 
limited to GEN3AIJK) using established computer algorithms 
including, but net limited to, ELASTX, EASTA, &LASTN, BLAST E' , 

20 TBLASTlt, and T3LASTX using the default parameters used, for 

example, at the National Center for Biot.echnol.cgy Information web 
site (www.ncbi.nlm.nih.go-v). The GTSs reported in the Sequence 
Listing have been compared to such a database (GENEANK), and the 
accession numbers of the genes that have been mutated are 
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presented in Fi^ire 2. Accordingly, an additional aspect of the 
subject invention includes mutated mammalian, preferably murine, 
cells, or isolated cell lines, that have at least one engineered 
mutation in a gene identified by GENBAdp; -or GEINESEQ (for example) 
a c o e s s i o n n u mb or i n F i. g u r e 2" . 

As used herein, one terms "mutated" or "mutation" mean that 



the uenetic Locus Lias been altered by a pr 



i. ""3; O 



involving the 



irteorat ion or incorporation of a genetically engineered 
polynucleotide sequence into the genome -of trie cell with the 
result tnat the subsequent levels of activity of ttie product 
normally encoded oy the locus is altered (i.e., reduced, 



i nc reased, or s ubs t an t i a i ly ab 1 a o ed ) 



I n t Li o s e i n s t a n c 



wnere 



t. Lie mutation substantially completely disrupts ttie expression or 
activity of tne product normally encoded by tne locus (i.e., a 
ncl L mutation) , a cell that is heterozvoou; for tne mutated 



allele will typically produce about -one half -of the product of a 
nonmutated cell i via a cene dosage effect), and about twice the 



amount of product produced by a 
mut ant all el e . 



c e 1 1. t h a t i s n o -mo z 1 



y =i 



s for the 



Trie term " r ecomb ir.a.n 1 1 y manipulated" sbiall mean tLiat such 
compositions comprisina such molecules or polynucleot idee have 
been genet i ca 1 1 y enginee red us i ng mo lecula r b i elogy met hodc- log ie s 
j n vi t rc o r ex vi.vc (see gene r a 1 1 y , 8 amio roc k et a 1 . , 19 o 9 , 
Molecular Cloning, A Laboratory Manual, Cede Springs Harbor 
Press, N.Y.; and Ausubel et ai. f 19£9, Current Protocols in 

Molecular Biology, Green Publishing Associates and Wiley 
I n t e r s c 1. e n c e , K . Y . ) . 

Where, tne specifically exemplified mammalian cells, i.e., 
embryonic stem cells (Lex-L cells from murine strain A129), are 
mutated by a process involving the insertion of at least a 
portion of a genetically engineered vector sequence into tne gene 
of interest, the mutated embryonic stem cells can be 
microinjected into blastocysts which are subsequently introduced 
into pseudop regnant female hosts and carried to term using 




established metirods such as those described in, for example, 
"Mouse Mutagenesis", 199R, Zambrowicz et al. r eds . , Lexicon 
Press, The Woodlands, TX, and periodic updates thereof, herein 
incorporated by reference. The resulting chimeric animals are 
subsequently ored to proiluco offspring capable of germline 
transmission of an allele containing the engineered mutation in 
the gene of interest. 

An alternative method -of producing mutated cells and animals 
in the specifically exemplified genes involves the process of 
gene targeting by homoloorous recombination using metnods such as 
these exemplified in d . S . Application Ser. No. 09/171,641, which 
is hereir; incorporated hy reference in its entirety. Mutations 
produced using such methods include, out are not limited to 
knockou: mutations, " kno c in" niutatioos (where a human gene, for 
example, is used to* replace its murine ortho-Logs), can be 
conditional, can include point mutations, and mutations that 
activate gene expression. Some of the mutations described above 
■'conditional mutations, point mutations, etc.) can oe produced 
via processes that involve the substantial removal of vector 
en CO' tied sequences (-often recombines mediated) subsequent to the 
incorporation of the recombinant 1 y manipulated sequences into the 
oeno'me . 



5.1. MUTATED MAMMALIAN CELLS OF THE PRESENT INVENTION 

The presently described mutated cells nave oenetically engineered 
mutations on genes identifiable- as cor respond inq to, or normally 
comprising, at least a portion of a sequence reported in the 
Sequence Listing as SEQ ID UOS : 1-391. Additional embodiments of 
the present invention are cells comprising engineered mutations 
in ruomologs, paralogs, orthologs, etc., of the mutated genes 
disclosed in the Sequence Listing. Such homoloqs, paralogs, and 
orthc legs include genes having sequences that hybridize to one or 
mere of the disclosed GTSs of SEQ IS UOS : 1-8 91 under stringent, 
o>r preferably highly stringent, conditions . Hybridization 
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conditions a Is o^5rc vi de an alternative means or identifying the 
mutated genes corresponding to the GTSs reported in the sequence 
Listing. Typically, such genes wilL be LdentifiabLe because a 
disclosed GTS, or portion thereof, shaLl hybridize to the gene 
5 under stringent: conditions. 

By way of example and not limitation, high stringency 
nyb r co i z a 1 1 -on conditions can toe defined as foLLov/s: 
Prehybridicat ion o r filters containing DXA to be screened is 
carried out for 3 h to> -overnight at 65' 1 in a buffer containing 

10 6X SSC, SOnM Tris-HCl (pH 7 . 5 j , LmM EDTA, 0.0 2;; PVP, 0 . i. >2 s 

XI col 1 , J . 1 I 33 A, an d 5 tO /.zo/ml denatured sal.rn on sperm DXA . 
Filters are hybridized for 4 5 h at 6 5 X: in prehybr.idicat.ion 
mixture containmo lOu/ug-'ml denatured salmon sperm DI1A and 5-2) x 
l- |b cpm of 3 - P-labeLed probe (alternatively, as in all 

15 hybridizat lens described nero-m, approxiruatel y 42, 44 , 4*1, 4 3 , 
50, 52, 54, 56, 58, 62, 64, 66, 6 3, 70, or about 72 degrees or 
more can be used; . The: -."'liters are then washed in approximately 
IX wash mix (10X wash mix contains .3M Mad, 0 . 6M Tris base, and 
0.02M EDTA, alternatively, as with all washes -described herein, 

20 2X, 3X, 4X, 5X, 6X wash mix, or more, can be used) twice tor 5 

m; nut es each at r-oon temperature, tnen m IX wash mix containing 
1- SI'S at 6U'"C (alternatively, as in all washes described herein, 
approximately 41, 1 4 , 4 6, 48, 50, 52, 54, 56, HI, 62, 64 , 66, 6 3, 
I 7 0 , or about 72 deorees o>r rmore can be used} tor about 10 mm, 

25 arid finally in 1 . Ill wash mix ■ alternatively, as in all imai 

washes describee; herein, approximately, 0.2X, 0.4X, U.6X, 0.8X, 
IX, or any concentration between about 2X and about 6X can be 
used m corn u no t ion wlch a sur table wash temperature) containing 
O.lo SDS at 60 -1 (alternatively, approxir.ato ly 42 , 44 , 4 6, 48 , 

10 5 U f 5 : f 5 -1 1 5 o t 5 o / 6 2 f o . ] t o, ^ f 5 y. f 10, o r a b ■ o ut 7 2 c e g rees or 
more can loe usee) for about : t min. The filters are then air 
dried and oxpesed to- x-ray £X 1m for autoradiography. In an 
alternative protocol, washmo -of filters is done for 17 C for 1 h 
in a solution containing 2X 2SC, O.Ull PVP, 0.01% Ficoll, and 
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0.01-5 BSA. Th^POis followed by a wash in 0 . bTsSC at 50 C C f-.-r -15 
min before autoradiography. Arother example of hybridisation 
under highly stringent renditions is nybrid .io.ation to filter- 
bound DNA in 0.5 K I^aHPO.,, 70 sodium dodeeyl sulfate (SDS), 1 mM 
5 XjTA at 05 C C, and washing in 0 . 1 :-:SSC/ : > . 1 0 SDS at 63°C (Ausubn-l 

F.M. o j t ul., eds. , 1939, current Protocols in Molecular Bio-logy, 
Vol. [, Green Publishing Associates, Inc., and John Wiley * sons, 
Inc., New York, at p. 2 , 1 u . 3 ) . 

Al terrtac o ve 1 y , moderately stringent conoitiens can be used (e.g., 
10 washing in 0 . 2xSSC/0 . 1 ' SDS a: 42 : C (Ausubel et ui., 1030, 

supra) . Moderately strinoie-nt conditions car. be adoli t i<ona 1 1 y 

defined, for example, as follows: Filters containing DMA are 
p recreated for 6 h at 0 0 ' : 0 in a solution containing 6X SSC, IX 
oenhart' s solution, 0.5 s E»S and 100 ug/ml denatured salmon sperm 

15 DMA. Hvijridizations are carried out i.n the same S'~> lut i-~»n and 1- 
10 x 10" cpm 2 P-labelecl probe os used. Filters are incubated in 
hybridisation mixture for 13-2u n at 55 '1 (alternatively, as in 
all hybridizations described herein, approximately 41, 44, 40, 
4c, 10, 50, 54, 50, 50, 00, 04, 00, 60, 70, on: about 70 degrees 

20 or more can be used in ooir.binat.ron with a suitable concentration 
cl salt) . The f titers are then washed in appro-xima to ly IX wash 
mix iiO>: wash max contains 3M IJaZO, 0 . 6M Iris base, and 0 . 0011 
X OTA , alternatively, as wuth all washes described herein, Oil, OX, 
4X, 5X, OX wash mix, or more, can be used) twice for 5 minutes 

25 each at roorr. temperature, then in IX wash mix co»nt;ainin^ 10 SDS 
at 60 C -.alternatively, as on all washes described herein, 
appr : ximately , 42, 44, 40, 4 o , 50, 50, 54 , =>:, =.y, 00, 04, 00, 
Or, 7 0 , or about '.'0 degrees : r more can oe used- for about 00 
ml n, and finally m 0 . ?X v:ash mix ( a Oterna t i ve ly , as in all final 

30 washes described herein approximately 0.2X, 0.4X, 0.0X, 0.8X, IX, 
or any concentration between about 2X and about OX can oe used in 
conjunction with a suitable wash temperature) containing 0.0' SDS 
an (EO'C -alternatively, approximately 42, 4 4 , 45, 4£, 50, 51, 54, 
50, 50, o-2, 64 , 00, OS, 7 0 , or about 72 decrees o>r more can be 
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used) for abou~0 min. The filters are then air dried and 
exposed to x-ray film for autoradiography. 

In an alternative protocol, washing of filters is done twice 
for 30 minutes at 60 C .in a solution containing IX SSC and 0.1% 
3 D$ . F 1 1 c e r s a r e b lotted dry a n d e xp o s ■:■ d for autoradio g r a p h y . 

Other conditions of moderate stringency which may be used 
are weLL-known in one art. :lr example, washing of filters can 
oo done at 37 o;: for 1 1; in a solution containing IX 230, 0.1.; 
S 03 . An other e x a :np 1 e o r h y b r i . cl i z a t l o n u n d e r m o deratel y s t rirgent 
conditions is wa s h 1 n g i n 0 . 2 x 2. 3 C / 0 . 1 ? 3 b 3 a o 4 0 2 0 ( Au sub o 1 e t: 
al . , l'?S9, zrupra) . Such less stringent conditions may also be, 
for example, low stringency hybridisation conditions. By way of 
example and not limitation, procedures using such conditions -of 
Low stringency are as fcdlo>ws isee also Snilo and Weinberg, 1981, 
Free. Katl. Acad. Sci. USA 7 3:6739-6792): Filters containing DNA 
are pret reared for 6 n at 4 0 ' 0 in a solution containing 151 
fc-rmarrude, 5X SSC, 50mK Tris-ilC:. (pel 7.9), 5mM F DTA , 0 . 1 . ?VP, 
0 . 1 % r'icol 1, 11 33A, and 500 ^g/ml denatured salmon sperm DNA. 
Hvbridizations are carr ied -out in the same solut ion wi th tne 
following modifications: 0.0:1; ?VP, 9. lis Fisoll, 0.9% B S A , 
100/uo/mL salmon sperm DNA, 10-; (wt/voi) dextran sulfate, and 5-20 
X 10" cpm ' 2 ?-labeled pro is used. 01 iters are incubated in 
hybridization mixture for 16-10 h at 40 C [alternatively, as on 
all hybridizations described herein, approximately 41, 44, 4 6, 
4 3 , 50, 52, 54, 50, 53, 0.1, 04, 00, 6 a , 70, or about 71 degrees 
-or mo r e can b e u s e :1 ) . Oh e f i ! t e r s a r e t h e n wash e cl i n 
approximately IX wash mix (lOx wash mix o: noains 5M XaOl, 0 . 6M 
Iris lease, and 0 . 01M EDOA, al t omat rvely , as with all washes 
described hereon, IX, 3X, 4X, 5X, IX wash mix, or more, can be 
used) twice for five minutes each at room temperature, then on IX 
wash mix containing 11 SOS at 10' C ; al t ernatrveiy , as on all 
washes descriLoed heroin, appr--.o:imat el y 42, 44, 4 6, 4=-:, 50, 



54 , 50, 53, 62, 64 , 66, 66, 7 u , c r ab«..ut 72 degrees or more can 
be used) for about 10 min, and finally in u . 3X wash mix 



( alternatively ,^^Ts in all final washes descri^j herein, 
approximately, 0.2X, 9.4X, 0.6M, 0.3>I, IX, or any concentration 



between about 



and about 6X can be used in com unction with a 



suitable wish temoerature) contair.in 3 0-2% SDS at 60 : C 



( a I te rnati vely , ac>pr in 



4 5, 4 * , 50, 50, M, 56, 58, 



, 66, 6h, 70, or about 71 decree? -or more :.:an loe use:!) for 



about 3 0 rnin. I 1 he filters 



air -dried and exoosed t.c 



) x — 



ray film for autoradiography. In yet another alternative 



protocol, washing of filters 



1 u t i.-on c ont a in inq 2 P 



■:; c ~- 



The 



V 



s ol ut ion 



■. s done for 1.5 h at 5 5 ''C in a 
ibmM Tns-HCl fpH 7.4), tmM FDTA, and 
■s rerdaced with fresh sola: ion and 



incubated an additional 1 



;0 i : ' ( 



Filters are then blotted 



dry and exposed for autoradiography. If necessary, filters are 
washeol for a third time at 6^-60 : 'C and reexpcsed to f ilm. Other 
conditions of low stringency which may be used are well known in 



he art ( e . c , 



emp-loyed : c r 



c r 



^. — ^ \ j f -' 



cies hybridizations) . 



e f e raiol y , PITS va r i a nt s i dent i f l e d 



i sol 



d us i .no: the 



a cove 



methods will also encode a f moot i : tally equivalent cone pnod 
(i.e., protein, polypeptide, :r domain thereof, encoding c>r 



t h e r wis e 



s o c ia te d w i t h a f u n -otic n o r s t r u c t u r e at 1 e a s t 



■artially encoded by the complementary GTC) . 



Low stringency :ond i 1 1 -.-us are well known to those of skill 
in the art, and will vary predictably depending on the specific, 
■organisms from which the library and the labeled sequences are 
derived. For guidance regaroino such -conditions see, for 
e o: amp 1 e , 5 amb r o c k e t a 1 . , 1 0 ■•• 9 , Mo 1 e t u 1 a r 51 o n i n g , A L a bo rat o r y 



Manual, Cold Sjorinas Harbor P r 



1 » a X 



; and Ausubel et al. f 



1 9 P 9 , Current Protocols in Molecular Biology, Green Publishing 
Associates and Wiley I nter so i ence , X.Y. 

The identification of hc-mcdogs, heterclogs, or paralogs of 
SEC lb UOS : 1-891 in other, preferably related, species can be 
useful for developing additional animal mo-del systems that are 



closely related to humans for purposes of drug discover 



-enes 



at other genetic loci within the genome that encode proteins 



which have extensive homology to one or more domains of the gene 
products encoded by SEj ID NOS : 1-891 can also be identified via 
similar techniques. In the case of cDNA libraries, such 
screening techniques z a n identify c 1 o n e s de r i ve d f r om 
alternatively spliced transcripts in the came or different 
species . 

Techniques useful to disrupt a gene in a cel. 1 and especially 
an ES cell tnat may already have a dr erupted gene are disclosed 
in copending 'J 5 patent applications Mo s . 0_:/72 6, 8 67 ; 03/728,963; 
C 8 / : 1 0 7 , 5 9 3 ; and 08/^42,806, all of which are hereby incorporated 
herein by reference in their entirety, are within the scoioe of 
the current invention to disrupt a gene that encodes a 
po> 1 ynuc le t t ide o f the current i nven t : on . 

5.2. USES OF THE DESCRIBED MUTATED GENES AND ANIMALS 

Tne described mutated cells and animals are used to 
investigate and define the cellular anol biological functions of 
tne mutated genes. Producing a scientific model that accurately 
accounts for the large number of genes, proteins, and 
ma c romolecules witni. n a single cell h a s t h u s far proved be y o n d 
the :apat i 1 i t i es -of existing computer technology. Et should thus 
not ice surprising that the far more complex task -of modeling the 
various intricacies, cross and direct redundancies, and 
interrelated functions of the various metabolic and eatabc-lic 
processes that occur within a single cell has also proven largely 
intractable to algorithmic methods of modeling and prediction. 
Even if cone assumes that computer modeling of inherently 
chaotic/heuristic processes will rapidly mature in the near 
future, such methods, at best, can -only provide predictions that 
subsequently require practical validation. Several decades of 
empirical data have proven that mutant phenotypes provide a 
valuable source of such validation. 

The mutated diploid mammalian cells of the present invention 
will initially exist as mutated dip-loid cells that are 



'^^^ept where qenes on the X or^d^: 



a r e 



heterozyg >us (~ept where genes on the X or r chromosomes are 
mutated) tor the mutations identified in the sequence listing. 
As such, via a "gene dosage" effect, the mutated cells can 
typically be characterized by the fact that they produce about 
one hell of tne mutated transcrip: / activity relative to cells 
having two nonmutated or wild tyre copies of the corresponding 
ge ne . 

When nut. an v animals are produced from one mutated cells, 
het.er:c\gous animals capable of germline transmission of the 
mutated allele can be bred to produce embryos or offspring that 
re homozygous for the mutant allele. Such animals or embryos 
a r 1 h s c u r c e o f tissues a n d '.cells t h a z d o not e x press 
physiologically relevant amounts of the mutated genes or 
act i v o o c os encode d z he reioy . A see r ding 1 y , an add o t bona 1 
e mb o d i me n t of t he present 1 io v e n 1 1 ■ o n are mutant cells and a n i m a 1 s 
that have homozygous mutations in genes identifiable as 
cor resp'ono ing to the GEIJEAIJK, or onher database accession, 
numbers provided on Figure 1', or are identifiable as a hcmologs, 
paralog, or orthol:gs of a sequen-.ee provider in the Sequence 
Listing. 

In aclditirri to providing imp-ortant information regarding the 
functional rede of a given gene in its nonmutated state (i.e., 
you learn about the function of the gene by discerning the 
effects of reducing or aerating trie activity normally encoded by 
the gone-, the described mutoto-o cells and animals can be used as 
disease models, or on assays for conpounas or genes (via gene 
delivery or transgenic: methods- that compensate for the mutant 
phenotypc and that can be used zd treat diseases and disorders 
related to the observed phenooype. Alternatively, such products 
and genes can also be used to enhance do s livable, if not normal, 
symptoms related to the observed phenotypes. 

The 'gene replacement / do^ i ver y therapies described at ove 
should be capable of delivering gene sequences to the cell types 




within patientsT/hich express the peptide or protein having the 
desired activity. 

The examples below are provided to illustrate the subject 
indention. These examples are provided by way of illustration 
and are not included for the purpose of limiting the invention in 
any way whatsoever. 

6.0. EXAMPLES 

6.1. GENERATION OF A LIBRARY OF MUTATED MOUSE ES CELLS DEFINED BY 

GTS SEQUENCES 

The retroviral vector VloTF: I, described in detail in U.S. 
application Ser. Mo. 0 : : /72 : ! , 9c 3 , filed October 11, 1996, was used 
to generate a library of gene trapped ES cell clones that 
represent a portion of. the described GTSs. A plasmicl containing 
the V1CTR 3 cassette was constructed by conventional cloning 
techniques and designed to employ the features described above . 
Namely, the cassette contained a PGK promoter directing 

transcription of an exon that encodes the puro marker and ends in 
a -.canonical splice donor sequence. At the end jf the puromycin 
exon, sequences were added as described that allow for the 
annealing of two nested PCK end sequencing primers. The vector 
backbone was based on pBluescript KS+ from Stratagene 
Corpora t ion . 

The olasmid construct was linearized by digestion with Sea I 
wnoch cuts at a urtio{ue site in the plasmia backbone. The plasmid 
was tnen trans fee ted into the mouse ES cell line A32 . 2 by 
e lect roporat ion using a BioRad Genepulser apparatus. After the 
cells were allowed to recover, gene trap clones were selected by 
adding puromycin to the medium at a final con cent rat i on of 3 
yug/ml . Pcsitive clones were allowed to grow under selection for 
approximately 10 days before being removed and cultured 
separately for storage and to determine the sequence of the 
disrupted gene . 



Total RNA^^is isolated from an aliquot oT" cells from each of 
13 gene trap clones chosen for study. Five micrograms of this 
RMA was usecl in a first strand cDMA synthesis reaction using the 
"RS" primer. This primer has unique sequences (for subsequent 
RJR ) on its 5 1 end and nine random nucleosides or nine T 
'thymidine) residues :>n it's 3' end. Reaction products frcm the 



firs: strand synthesis were added dire 



4- 1 . 



t a ? 2 R w i t h outer 
primers specific for the engineered sequences of puromycin and 
the "RS" primer. After amplification, an aliquot of reaction 
przducts were subject to- a second round of amplification using 



primers internal, or tested, relative to the first set of ?CF:. 
primers. This second amplification provided more reaction 
product for sequencing and als: provided increased specificity 
for the soecif icallv crene crapred Lcl A . 

The products of the nested ?CR were visualized by agarose 
je 1 electrophoresis, and seventeen of the eighteen clones 



r< r "> v 



vided at least 



ne 



a n ol t :o a t w a s v i s i b 1 e e > n the gel w i t h 



ethidium bromide staining. Most 



o n 1 y a sin u 1 e b a n d wh i ch i s 



3:1 ci 



■ i 



v a n t a g e L n that a single rand is generally e a s i e r 
sequence. The RCR products were sequenced directly .after excess 
PCR primers and nucleotides were removed by filtration in a spin 
cc'lumn ( Cent r icon- 1 00 , Micon) . DMA was added directly to -dye 
terminator sequencing reactions (purchased from ABI) using the 
standard XI 3 forward primer a region for which was built into the 
end cf the puro ex:n in all of the FCR fragments. 

Subsequent studies nave used both VICTR 3 and VICTR 20. 
Lioe VICTR 3, VICTR 20 is exemplary of a family of vectors that 
incorporate two main functional units: a sequence acquisition 
component having a strong promoter element ( phospho>g L ycerate 
kinase 1) active in ES cells that is fused to the puromycin 
resistance gene (or other exon sequence.' that is followed by a 
synthetic consensus splice dcnor :SD) sequence and lacks an 
operatively positioned pol y a deny lat ion sequence downstream from 
the SD sequence ( E'GKpurc-SD) ; and Ij a mutagenic component that 



a^L^> 



incorporates a splice acceptor sequence fuse cT^o a selectable 
and,'or colorimetric marker gene and followeei by a pelyadenylation 
sequence (for example, 3A; : qe ,pA, SAneopA, SAI RES neopA, or 
SAIRESf. geopA) . 

Also like VICTR 3, stop cod oris nave been engineered into all 



t to r e e r e a d i n o frame s 



n 



ie 



3 i on between the 



eri 



of the 



select aole marker and the sol ice donor site. A diacjrammacic 



description of structure and functions -of VITTFs 



and I 



1 s 



or-oviaed in Figure 1. 



When VIcTPs 



nd various variations thereof such 



the vectors ana methods describe-:! in U.S. Applications Ser. Kos. 
0 o 1 / c 7 €■ , o ? 0 , a n d 61'/ C 1 9 o , o ? :'■ ( o h e cl iscl o s u r e s of w h i c h are h erei.n 



incorporated by refer e n c e > , w ere a 



in the commercial scale 



a p p ligation of the p r e s e n 1 1 y d i s ■:: 1 o s e d i r. v ention, mi a n y 

mat agoni zed F5 cell clones were rapidly engineered and obtained. 

Sequence analysis obtained from these clones has identified a 



W I 



SFO I 



riety of sequences. Each o-f the sequences presented m 



: 1 - 1 0 1 identif',* n-.-vel ma tat ions 



O 1 >" 



th 



ma regions 
oo the 



of mammalian genes that lelaiv. i fiahle as corresponding to 
sequences presented in the Sequence Listing. Alternatively, the 
■described mutated cells are described by one database (GFNBAhP', 



GENSEQ, etc. ) accession numbers 



tk- 



c i 1 r r '0 s 



ndirg 



t hat 



have been mutated (see Figure 1) . The described mutated cells, 
and preferably FS cells, prootide a valuable resource for 
defining, evaluating, or validating the kic-lc-gical function or 
disease/pharmaceutical relevance -of each of these genes. 

The cloned 3' FACE produces resulting after the target FS 
cells were infected with one of one described gene trap vectors 
were.' purified using conventional column chrc-matc-graphy, (e.g. , 
5 30 0 and G-5 0 columns), and the products were recovered toy 
cent, ri ruga 1 1 on . Purified PC? products were quantified by 
fluorescence using PiccGreen (Molecular Probes, Inc., Eugene 
Oregon.i as per the manufacturer's instructions . 



# • 

nSfor cycle socfuencin j reactions w 



Dye terminSfor cycle sequencing reactions with AmpliTaq© FS 
DNA p jlymerase (Perkin Elmer Applied Biosystems, Foster City, CA) 
were carried out using approximate 1 y 7 pmoles of sequencing 
primer, and approximately 50-lOJ ng of 1' template. 
Unincorporated dye terminators were removed f r cm the completed 
sequencing reactions using G-5o columns as described above. The 
reactions were dried under vacuum, resuspended in loading buffer, 
and elect rophc rese d through a k% Long Ranger acrylamide gel ( FHC 
BioProducts, F.ockland, M£) on an A3I Prisml) 3-77 with hL upgrade 
as per the manufacturer's instructions. The sequences of the 
resulting ampl icons, or GTSs, are described in SFo ID N03 : 1-3 91. 

All publications and patents mentioned in the above 
specification are herein incorporated by reference. Various 
modifications and variations of the described method and system 
of trie invention will be apparent to these skilled in the art 
without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the 
invent ion as claimed should not oe unduly limited tc such 
specific embodiments . Indeed, various modifications -of the 
above-described modes for carrying out the invention which are 
obvious to those skilled in the field of molecular biology or 
related fields are intended to be within the scope of the 
following claims. 



